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WEB CQ?n^NTTRAN SXSTEBl AND l^tHOD FOR SMALL 

: ; technical Heidi 

5 . . The prfeSOTt ctoirt§nt motel, 

particularly^ to l wet ^Onteik trairi^ iof [ d ^sn^ , 

is prepiaifbl stut^ble for .'.a? ^%>lay p^fprm^ce 

on; at;^^<^^iay; 
Background Art 

"$e^ ^Jtat^nrt forms ^ 

: 'Sd|j^^ of ihfc 4eis^ ftfe* small 

display -'^©iri^ 

^splayde^ce/: 

20/ & order wid^it ; *cbnyert?rig '^^iod|s/]^ye * Iteeii . j 

. priopp sed: 4i o wey er, : s inc^as^ 

imtiai methods for si^pp^ or a low jpdfoimance PDA 

^^pnal Digits ^sist^t^ etc., tf^ inuch information cannot be well 

displayed- This is caiised by ia limit to a device p^ormance arid a main use of a wireless 
25 Internet markup language with a simple expression capability such as a text or HDML 
, (Handheld Device M^kupL Markup Language), etc. 

■ : The conventional ^ converting has a drawback in that since only ai portion of the 
V existing web information is extracted and converted, it is difficult to exactly convert a 
current romphcate-stractwed web page having a lot of images and information 
30 ; dinultaneoxisly expressed. . 

After that, as devices of the high performance PDA, hand-held personal computer, 
etc. have appeared, converting methods therefor have been continuously studied. As a 
result, a converting tool that operates in a server such as WebSphere Converting Publisher, 
Sypglass Prism, etc. manufactured by IBM has appeared. The converting tool uses a 
3 5 method in. which a web server manager converts through its manual work so as to more 
exactly convert a web content The. converting tool has a disadvantage in that noh- 
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automatic converting is performed, and a converting-rseiyed document is limited in its 
range comparing with an enormpns amount 6f the document on the wire Internet 

Pad-Hy etc. for providing a zbdm-Wzoom^ut fto Pad^Pfr, etc. hay e 

5 an advantage in that a device p^onnance can be mo*e exactly; understood and a user's 
requirement can be .e^ily. reflected, .but have ai^'inco^ thait after general ■ 

: urfpnnatipn on atp^ pkg^^^^ 
again checked for £ sufcst^^ 
" . at each portion of the page. \ . •.;>/ ;V\:.'; V ; '- J " : 

L 0 Fin&er, ; as ' tiie £ oriyerting methods fuhctiomng at. a prc&y ; serve*, there are; Top I 

Digester thai handheld ; pr\ ceflul^^ 

; ^ ^ p^orin^ 

tl^uj^ t>y a perso^ 

15 r - ^ /thwefpr; For 1 'exWt ^"ctihv^&g, /a; jplurality of complicated algorithms \ ^-t^^f^d'' 

information oii ^ the cony ©rtm^ ires&t is e^ 

etc. HoweV^ is inpphv^ 

; searqhiStiie to a limited itfojm a ^ cdmpiicSted Cfrtej^pry stractufe, 
aiid 4 use of a plurality H 
» 0 . Other ^nyCTtipnal arts are well known as disclosed in ^(^-tme internet content 

converting method and system* in Korean Patent Laid-Qpen No. 2002-3 1^91 (Applicatioh 
No. 10-2000-6662342), and in "Content forinulation system and method" ill Korean Patent 

v Laid-Open Nov 2002-15223 (Application No. 10-2000-0048415). Herein, the "Real-time 
internet content converting method and system'' uses a predetermined rule such that a 
:5 portion of a document content is extracted, page-divided or, converted into other markup 
languages. The converting into a document summary is merely performed and a document 
analysis method and a re-expression method are not disclosed in detail. Further, the 
"Content processing system and method thereof merely refers to a general construction of 
a converting system for the small device user of a wire web content. 

0 Accordingly, a conventional web document converting technique does not reflect a 

rapid improvement of the device performance, and means converting in a way of extraction 
of o nly a s pecific p ortion orac ontent s ummary, the c omplicated c ategory s tructure for 
expressing this, and the page-division and link-connection. A detailed proposal cannot be 
found for a clearly analyzing, converting and expressing method. That is, in most of earlier 

5 studies, the simple text summarizing converting is perfonned for the low performance 
cellular phone series device. Recently, high performance hand-held devices have been 
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appeared, but the converting for content reduction such as the cpntent summary, toe image 
deletion, etc/ is still mainstream. Or, a methpd for the page-diyision and tb£ page-link 
using link is provided, but in case a jink depth is deepen even without a substantial consent ■ 
summary, there is an incohvehimce in th^ difficult to be understood and 

5 a previous page is again ittiunedl 

Disclosure of the fty&tiQii? ; 

* Accordingly, the present InV^^ to systrai ^d method for parsing* 

imriti-docito oil elements, Vliich ; substantially obviate ; one or more of the ! 

LO problems <Wto linntktio^ krt-; : -: 

Accordm^y, the p^ 
and method for a 

j : V to lirnita^^ -f ; A , -y : -] t 

«'-, An object of thfe preterit invention is to provide a w#> content coijy^rtiiig system 
L 5 : and method for a small displa.^ device in which a cuireht web dbcumeiit i^cludinjg a lot of 
complicated irrforihatipn can be converted to reflect a cohtent of an original document to 
t'Vthe maximum and simultaneoiisly to have a cpnv^iiie^ 
• performance iniprovem^ . V " 

i Additional advantages, objects, and features of the mvehtibn will bb set forth in. 

2 0 part in the description which follows and in part yviU. h^rpk ^m^Ao those having 

ordinary skill in the art upon examination .of the following or may be learned from practice 
of the invention. The obj ectives and o^er adyahtages of the invention may be realized and 
attained by the structure particularly pointed out in the written description and claims, 
hereof as well as the appended drawings. 
25 To achieve these objects and other advantages and in accordance with the purpose 

of the invention, as embodied and broadly described herein, there is provided a web 
content converting system for converting a large display screen web document into a small 
display screen web document, the system including: a preprocessor for standardizing a. 
non-standard web document having an erroneous tag to output the standardized web 

3 0 document in a data format suitable for analysis; a client profile analyzer for extracting and 

managing client information; a structure analyzer for receiving the web document 
standardized in the preprocessor to set the web document to a content unit piece 
(component) according to a document analysis algorithm; an image converter for 
extracting information on an image encoding/decoding procedure and an image size 
35 included in the web document; a component block extractor for grouping the set content 
unit piece (component) to similar groups within a range not exceeding a maximal width by 
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using an attribution value of the. cpntent unit piece (component) and client performance 
information; a component block; categorizer for categorizing each of component bldcks 
generated by the component" ^blbck extractor jnto index and body content portions in 
accordance with a content ^ chatact^ristic; an index generator for extracting infonriation on ,, 
5 image or text index from the cx>mi>6nerit block categorized into , the index: portion, and 
generating a script file and .ah a&Moiial tag colieciion fpir expressing . the extracted 
i^or^tioi^ avbW 

r into a vbice tn^l^p lianguage' to perform a . voice si^ori^g f uiifction; and a fiyperText 
Markijp Language:^ for j-earriangiiig and recpn^ctnig' the generated 

0 content object elements accordmg fo; a docummt p 
screen Web document. 

In anbthfe} asjifect; of - .the present invention, ithere is! provided a wjeb content 

screen! Web document into ja small display \ 
screen Web doctoeni a ^ preprossesing step for standardizirig a nph- 
L5 standard WebV4oc^ dprbileous tag to output the standardized web 

document in. a data; format . suitable fpr analysis; ai web document analyzing step for 
• receiving the standardized web dojcument and aiialyzing a tag according to: a document 
analysis ^ algorithni to: sk the web document: to a; content unit piece (component); a 
: component block setting step for grouping the set content unit piece (component) to similar 
20. groups within a range not exceeding a maximal width by using ail attribution value of the 
content unit piece (component) and - client pearfonriance information; a component block 
: categorizing step for categorizing each of component blocks generated by the component 
block extractor into index and b<?dy ^content portions in accordance with a -content 
characteristic; an index generating step for extracting information on image or text index 
25 from the component block categorized into the index, portion, and generating a script file 
and an additional tag collection for expressing the extracted infonnation; a voice markup 
generating step for converting a text-centered body content block into a voice markup 
language to perform a voice supporting function; and a Hypertext Markup Language 
(HTML) generating step for rearranging and reconstructing the generated content object 
3 0 elements according to a document pattern to generate the small display screen web 
document. 

According to the above construction and method, the present invention provides a 
convenient interface in which at characteristic of the web document is reflected for 
simultaneously expressing a lot ' of current complicated information through the 
3 5. rearrangement by the content unit block, not the conventional information extracting and ; 
summarizing method, and a visual and auditory expression is simultaneously supported ; 
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without a left and right scroll through index gener^on.and categorization of the content 
unit block, and the converting into a format of a voice siij^ofting document, not a 
conventional method of an index-structure having more depths or jpagendivision; . 

Accordingly, in the present invention, a total \ web document can be .browsed 
5 . without the left and right scroll through the reairmgement ; bf.tfaie content iinit block, the 

extraction of the index blcK:k ^ 
: yr sizei of the display device, a more convenieiit interf ^e can^be provided by converting into 
the yoice supporting j^kixp r ^^^gQ in casb pf the ^tHfenterpd content body block, a 
: content of the; original w£b document can be reflected to the maximum by constructing a 
ip total stractoesmtablyfora^ 

It is to be understood -'tfatat/t^ 
detailed description 'of the presenting are; 
V '?1 intehd^ to provide further expiration of the invention as claimed. 

15 Brief Description of the Drawings 

The accompanying drawings, whiph ^e m^ 
; understanding of the invention and are incorporated in and constitute a part of this 
application, illustrate ^bodimeht(s) of the invention and together with the description 
serve to explain the principle of the invention 
20 FIG; I is an ek^plary view iUustrktm 

blocks different from one another through visual categorizing arid grouping; 

FIG. 2 is a conceptive view illustrating si module construction of a web content 
converting system for a small display device according to a! preferred embodiment of the 
present invention; 

25 FIG. 3 is a view illustrating an expression class relation of a table tag; . 

FIG, 4 is a flow chart illustrating an operational procedure of a web content 
converting system for a small display device according to a preferred embodiment of the 
present invention; 

FIG. 5 is a flow chart of illustrating a detailed algorithm of a web document 
3 0 analyzing step of FIG. 4; 

FIG. 6 is a flow chart of illustrating , a detailed algorithm of a component block 
setting step of FIG. 4; 

FIGs. 7A and 7B are exemplary views fpr describing a web document analyzing 
step and a component block extracting step according to a preferred embodiment of the 
35 present invention; . 
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FIG. 8 is a 'flow chart illmtrating a detaH^ algorithm of a component block 
categorizing step of FIG. 4; 

Fids. 9A aiad $>Ba^e exemplary y^ 
content accorc^g to a prefOT of the present invention 

Best Mode for Cariyin§ put the Invention 

Reference will tiow be preparedjin derail to the preferred embodiments of the : 
present invenition, example^ 6f winch are illustrated in the ^companying ^drawings. 
Wherever pdssible^ the same ^ reference numbers will hp used ^ througjiout the itowings to 

refer to &e same or y.; /'. % :L- :, \.,iy/ '";•;.'*>. 

M j is; an exemplary view illustrating a web docum 
blocks different tfom^ tt ; : 

Referring to FIGv 1, the w^ designed for a Viraal 

content haying a .meanmgful ^ ii. laybut cand -at structural tag such that : a 

manufacturer of a HTML (Hy^^ex^ the cohtent ; 

Most of the visual ^categorizations use the tog for. a^ "TABLE", 
etc., and accordingly, the tags can be analyzed to mder^and a total structure. At this time, 
some injudicious use of a tag collection and an unclear ^ categorization in a structure and .a : 
ineaning of the HTML itself are considered to utilize aici attribution value of the tag, a d;ata 
characteristic of the ta^, and position infoOTation. for expressing data.irtfq^ the tag 

object, etc. as well as the structo • 

: Through the structure analysis of the web document, a minimal content unit piece 
101 (it is called "component") constructing a visual categorization layout as shown in FIG. 
1 is set, and the content unit piece 101 is grouped considering a performance, particularly a 
display performance ,of the user device, and is expressed as a content unit block (it is called 
"component block") 102. 

The content unit blocks 102 are categorized into an "index" portion and a "cohtent 
body" portion according to a characteristic of the content, and are respectively re- 
expressed in a suitable format The index portion is re-expressed in a format of an upper 
selected box as shown in 121 of FIG. 9 A, which will be described later, and the body 
portion is merely rearranged without any converting ipto a main content portion as shown 
in 122 of FIG. 9A or converted into a voice supportable document format as shown in 123 
of FIG. 9B for expression. 

FIG. , 2 is a conceptive view illustrating a module construction of a web cohtent 
converting System for a small display device according to a preferred embodiment of the 
present invention, and FIG. 4 is a flow chart illustrating an Operational procedure of the 
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web content converting system for the small display device according to a^feferred 
embodiment of the present hWerition. 
As shbwn m M 

invention includes detailed modules 201 to 209 for performing operations of a 
5 preprocessing st^p (Si), a vveb document '^^yimg step (Si), a web ddcxmient converting 
.; step (S3) I and a web'd^ 

'\i Thepr^irocesfcin^^^ 

aiialyzer 202.7The preprb^ document including 

1 an erroneoiib -^ in a data format suitable for> 

L0 . analysis. The ^ fecieptioii .function bf *Mp&{ 
information. TheL cfidaf. i^^ 

traiismission or.c^utiH^a^ Besides, ian 

input^oi^ut m module is p^qnnbd in the ^ri^irbc^smg st^ ^ 

L5 In the web document analyzing step (S2), a layout-based structure analyzer 203 

receives the web document standardized in the . preprocessirig step (SI), arid the >veb : 

' . -■ / document is set to the content Unit piece (cqihpoiient) through a web document an^yzang 
. algorithm. An image converter; 204 extracts iiifbimatibn on an image ericqdmg/decodmg 
procedure and an image size of the web document ; ' '■■ 

20 In the web document converting Step (S3), a component block extractor 205 • 

performs grouping of the defined content pnit piece (component) to similar pieces within a 
range not exceeding a maximal width (MAX^WTOTH) of a single screen by using 
information on a client performance and the attribution value of the content unit piece 
(component). A component block categorizer 206 categorizes each component block into 

2*5. the "index" and "body content* 9 portions depending on the characteristic of the content. 

The web document generating step (S4) performs a procedure of generating 
necessary content objects. An index generator 207 extracts image or text index information 
from the index-categorized component block, and generates a script file and an additional 
. tag collection for expressing the extracted information. An auditory markup generator 208 

30 performs a converting procedure of a text-centered 'body content block into a markup 
language such as voiceXML, etc. so as to perform an auditory supporting function. At this 
time, a browser should provide a function of rendering the web document of auditory 
information to sound. Lastly, a customized HTML generator 209 suitably rearranges and 
re-constructs content object elements generated iri an earlier step according to a document 

35 pattern' to generate a customized web document. 
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FIG/ 4 is a flow chart for d^pribing aLtdtal. operational procedure of FIG. 2. 
Referring to the dra^gs, ^ oripd ^ ii^uttd to standardize .the HTML' 

document, iand then a data structure havi&g a;OT format is outjputted (401 to 

403), Thes6 steps ;»i££je^ module of FIG. 2> Intheweb 

5 document andyziag (Hf^ . 

tag, and this proc&diiffe is performed in the structure analyzer 203 and the image! cb&verter 
...2^ maiy2ang step 404 wiU be! • 

describedbel^ . ' ^ ' / : V .\' 

Aftte. to^'iag' toalj^g.st^,. a componmt'bii^k : setting' rtep 405^fe'-p^onn^ t m ; - ■ 
10 the component blb^ 

step 406 is petformjeii ill the compo^rat block 206 of FIG. 2. Each of the • 

algorithms of the pompbnentblock setting step 405 and the component block categorizing 
step 40$ is described vi^t^ charts 6f FIGsl 6 ahd 8. v : ; y 

. F & detailedalgori&m of the web document analyzing 

15 step 404 will be described as foJlpWis. . . ; 

The ianalysis ^gorithm of the present invention will be described for the case in ' 
which the tags such aii :^ a -7. 

: specific tag '-^^^is-de^f^^^ component to ;bie used as a miniin^ unit of ^ : \ 

unit analysis, .v . ]■".': : '\ ' ' * ' •* v' ■ - 

20 " First, a HTML< document tfe,e data structure is inputted, and the maximal screen ' 

* r width received through tjble user device is defined as the maximal width 'TWAX^WTOTH". ^ 
(50i, 502). In the analyzing procedure, information as in Table 1 is additionally stored in a 
; tag node <HTD> and is later ^ 

* • Table l - : ' . ' \ ■ ' \' '\ 



Variable 


Content 


width 


Width value being re-calculated in pixel unit 


Comp_num 


Value for expressing ID of component in case of setting to component 
General component: (sequence number,0,0) 
Nested component: 

(0, first number of Compjaum of first child, first number of Compjaum 
of last child)- 


Col_num 


Number representing at which column to position in layout of total table 
stracture. - ' : ' ; 


Row_num 


Number representing at which row to position, in layout; of total table 


a 
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structure 


Table_depth 

■ ■' • : • ' v ' • 
. *■,■'*'"* 


Reprobating number of ancestor tag node <Table> pf <1JLPV that is, 
depth of nested^table 



After ^ initialization for a global variable is ended in a step of 502, all of the tag 
nodes are visited in a Reorder ;^n^^\ : ^fail6}ttje following procedure are repetitively 

performed <505>> >"? ' £ 'V/r''--\'.'-', : ' l - v - ' ' v. v';'- 

M; case; of 4* . vtatk &dti : : $f&ig; <TABLE> tag (504), the iable depth 
(Table^depth) is checked (505), and in case of the critical value (e.g., 3) being exceeded, 
the ^able^ 'tag-'a^ are regarded as a general content to 

perform! only a width setting step (506) tvithpUt any further analysis. In case the table depth 
Xrable_depdiJ> does : '^'^tp^^'epti^ yalue (e.g.*. 3), £ value of the table depth 

(TaWdept^ « ;-f ; s .\'-> ;; " :*• 

'■ In case of the visited node being <Y9> tag (508), a . row number (Rpw^num) is 
increased (509): However, ih case of the first row of the nested table, the row number is 
not increased. Further; in case of the <TR> tag of the root table, a column number 

(Col_num)isinto . . 

hi case o f the v isited h ode b eing <OT» t ag ( 51 6), it i s d eternrmed whether the 
content is mcluded (511) to increase the column number (Coi^rium) (5 12). However, a first 
<TD> of the nested table <TR> is not increased. The width setting step 522 is performed in 
case the <TD> does not include the content for use in a layout expression, and the 
component is set and structural information is added in case the content is included. 

That is, the component is defined as <TD> tag block having the content. If the 
<TABLE> tag is included as a child among the component (513), set is made to the nested 
component to mark the. value of the component number (Comp_num) as shown in Taple 1 
(514), and in case tags other than the <TABLF> are included as the content, set is made to 
a general component to define a variable of the component number (Comp_num) as an 
increased sequence number (515). 

Referring to the expression class relation view of the <TABLE> tag of FIG. 3, a 
tag kind that can be included in the <TD> tag can be checked. Referring to the drawings, 
the table is categorized into TR and CAPTION, and the TR is categorized into TH and TD. 

hi case the visited node is <1MG> (516), the width is checked and then changed 
(517, 518). If the width is changed, if is checked whether the image map is set. If . the image 
map is set, a COORDS attribution value of an image map code <AREA> representing a 
coordinate value is modified using a formula of 520. In the width' setting procedure of the 
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step 518, a %-set value is exchanged into a pixel, the width is substituted with the maximal 
width (MAX_WIDTH) in case the Widttexcpeds the maximal width (MAX_WTO1II), and 
an analogy is made using the <TR> width, a sum of the <TD> width and a maximal : 
<IMG> width, etc; if the width attribution value is riot set : - 

5 FIGs. 7A and 7B are exemplary view for describing the web document analyzing . 

step and the component block extracting step according to a preferred embodiment of the 
. present invention. . / V. V/" V'.'*. 1 '\\^vV- 

Throu^^ 
the algorithm of FIG. Si^chiepked. V 

10 ; In FI<j. 7A iUusfeti^g the visual expression of the s^^ / 

/ . <TIC>, <TD^> block are expressed, and the coniponent is set for the <TD> tag block haying 
the content. Additional iiiforination is shown in the foUoTOng Table 2. In FIG: 7B 

T .expressing the tag coUec^^ of the structural tag, the class : 

telatiori ^ between the tags can b^ . ~"\r ;. 

15 Table2 . " ' , }\ ■ 



(A) 


Comp_num 


Rownum 


Coljaum 


Table_depth 


Width V..\. 


CD 


(1,0,0) 


1 


, 1 . ■ 




200 




(2,0,0) 


• .1 " •• 


'.: 2 


• ' i ■ ' •;• 


400 




(3,0,0) 




,' ; .3; 


1 


200 


® 


(0,4,7) 


2-5 . 


1-1 


1 


150 


: ® ;. 


, (4,0,0) 


2 




..... . 2 ; ■- 


150 


© 


(5,0,0) 




1 


. 2 . 


150 


© 


(6,0,0) 


•4 


1 


2 ' . 


150 


© 


(7,0,0) 


•• . 5 • • • 


1 


.2 


150 




(0,8,15) 


2-5 . 


2-4 ■ 


1 


650-* 
MAX_WE)TH 


® 


(8,0,0) 


2 • 


.' 2 


■ . 2 , 


650-* 
MAXJWIDTH 




• (0,9,14) . 


3-5 


2-3 


2 


400 


® 


(9,0,0) 


3 


2 


3 


200 




(10,0,0) 


3 


3 . 


' 3 ■ 


200 



to 
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n 1 ft ft^ • 


4 




.3 


9<Yri 




fl2 00V 




• 3 


:. ..V 3; '•• 


200 


. @ 


(13,0,0) 


: 3'.; 


'".2; • 


' ' • • • ; . 


• 200 


: ® . 


(14,0,0) 


.5-'./ 




.. ; V,3'" • . ; 


,200. 


• ©■ 


(15,0,0) 


■• ■ ". • 3 ' 






•-■ 250 ;• 




(16,0,0) 




1 




8^MAX_WIDTH 



In the above Table 2,^ ^(^^ 
(Compjnum) indicated in FIGs. 7A and 7B, and it is assmn^ thit the 
:. (M^iJ^Tfy isbeW 500 pixels; . ';:Y/- ;•■:■/ 

;5 ' ;Next, the component block btmcU^ 

reference: to the component utritby a single <ET)> of a sepiarate <^ABL^ fejg to be , 
insi^6d into toe sam^ 

With reference to FIQ. 6 and FIG, 7Bj the detailed algorithm of the component 
block letting step (405) will be described as follows^ 

10 '•' First, the component trbe (C^mpion^tjree) is inputted to check information ^o^ •/< 

initial width of all component nodes, and then the following procedure is performed when 
the maximal width (MAX_WIDTH) is exceeded (601 - 604). It is determined whether 
there is a sibling node of the current component node (A), and thai if there is the sibling 
node, a grouping procedure is performed for bundling similar sibling nodes within the ■' 

15 range of hot exceeding the maximal widfe (MAX^WIDTH) (605 - 60 

FIG. 7B, the component of (D, ©, (S) can be made to a group (CD),(<2)),(<3)) or 

(0®),(©). " / . • 

In the following table blocking step (608), all tag collection belonging to each of 

the groups are expressed as one table block in a format such as 
2 0 "<TABLExTR>Gomponent ®,®<TR></TABLE>". Or; if there is no sibling node, 

only the table blocking procedure of the component node is performed in the step 608. 

Li the table block rearranging step of the step 609, the table block newly generated 

in an upper procedure is inserted into a previous sibling node of the <TABLE> node (B) as 

the grandparent node of the (A). 
25: If the (A) is the last <TD> node of the (B) (610) and the (B) is the nested table 

(61 1), a next step is performed (612), and otherwise, a next node is visited to repetitively 

perform earlier procedure in a step 602. 
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The next step is perforated when the 0 y ©M of FIG. 7B are the (A), that is, the / 
component being currently visited. In case the uppd: ancestor <TD> having the (B) as the 
child, that is, this (d) ! is the nested component, the step 609 is performed. In other words, 
the ®,@ of FIG. 7B and e acho if the ( G) b ecbnies ® and @'\ With reference to the 
child node (701 of FIG. 7B) including the (B) ainong the child nod^s of the (C), all sibling : 
nodes at left arid right sides ^bundled by each of the table blocks (702^703 of FIG. 7B). . v 
Again, tihie t^le block iiic is generated (614), and the; step; 609 is repetitively, v 

performed , . •'•':';/ }-;.■■:;" ' ,v, \' \. - V- ■ 

* The component is extracted ais one exprc^ 
the extracted component is defined as the component block. Each of the component blocks 
has an arrangement sequence determined according to a position of the component on thb : 
tree, and is expressed in a shape of a table block, up to <JoWn depending oji the sequence! 

Ref ening cx>ntm 
categorizing step 406 will be described. . . 

The component block tree is inputted to visit all component blocks while the : : . 
content pattern of the component block is compared (801 — 803). At this time, a. usable 
comparative variable is arranged in the foUowingTable3. 

* Table 3 : : ■ y" V ; j. 



Variable 


Expected pattern ^ ' ; . 


TextJLength 


Similar repetition, limited short length 


hnage_Width 


Similar repetition, limited width 


LinkJNumber 


Almost all contents have link information. Comparing position o 
connected document, similarity of file name ' 


Row_num 


Limiting to small number. limiting to block arranged at upper stage h 
web document 


Col_num 


Limiting to maximal or minimal value! Limiting to block arranged at lef 
orrightside. 



Depending on whether or not a result value of the pattern comparison exceeds a 
certain critical value, the index type (INDEX type) is determined (804, 805). The 
component block determined as the index (INDEX) respectively sets a type value to an 
image index (MDEX_I) and a text index (INDEXJT) (806 - 808) depending on whether 
data type of the content thereof is the image or text V 
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The block not being the index. (D?0DE39 is categorized as the body (BODY), and is 
categorized as W voice body :(BpDy_>^ type for inverting into a voice supportable 
document , and a general' • body (BODY-b) processed as other general content blocks 
according to a relative importance of the text to the content included 009 - 812>. In case 
of not bemg toe last block in the step 8l3, ; a procedure is fettmli.^&ag- fiom toe' step 
•802 for toe next block. .;• ; \, .. •• •.. \ ■■ ;. • ;"*■.'. 

The'ait^ 

chart sno\^g a"^ : 

Referring ; J&: component block is ^categorized (467-409, 

412),, toe steps: 41 1, 413, 414, of FIG: 4. arepe^o^ed br toe- component block is well 
extracted (410) according^ 'to toe ^ type of each, cotopon^ block. This procedure iis 
performed: for aU coiniponent block (415), and each of toe blocks are ; smtabiy aj^ged in. 
toe laststep 41 6. to generate anew^H^ An bperatioh'procedure b 

type of toe component block will be : described as follows. . • ■ : . • 

If toe type of toe component block is toe voice body (BODY_y)(Type ==- 
BODYV), toe voice document generating step (4i 1) is performed to generate toe voice, 
supporting document This is performed in toe Voice markup generator 208 module of FIG: 
.2, and all text portions can be added as toe <prompt> value as in a sample code of toe 
following Table 4 in toe block to generate, a simple VoiceXML document. The generated 
document is stored as a separate file andjs cotoaected witoalii%m^ 

Table 4 

<?xml version^' L0"?> 
<vxml version-' 1.0"> . 
<form> 

<block> 

<prompt>- 

(Adding text information extracted from Blcbk categorized as BODY_V, 

to value) 

</prompt> 
<disconnect/> 
</block> 
</form> 
</vxml> ..' 
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. Herein, if the type of the eomponent block is the general body (BODY^G)(Type 
== BODY_G),:it is extracted well for rearrangement due to the general content element 

If the type of the compbnent block i[s; : the image index (INDEX_I)(Type = 
INDEXJT), the image index (image Index) expressed in the Java Script through the image : 
index generating step (413) is generated. As in an example of a sample code of the 4 
following Table 5, a simple script fUe is automatic^^^ image file i$ 

; mapped for its iembbdirtent 

Table 5 

// jayascript filled into HEAD 

• ;<H ' 

; imagel==. new ImageQ; 
image! :sirc "iinagei.gif *; 
image2= new lmage0; 
image2.sarc = "image2.gif ' 
image3^ new ImageO; 
image3:src == "imageSLgif 1 ; 
image4^ new ImageO; 
image4.src = "image4^f f ; 
: links = new Array; 
links[0] = "LlNK#r , ; 
links[l] = "LINK #2"; 
links[2] = "IJNk#3"; 
linkst3] = "LINK#4"; 
function imgchange(){ 

var imageNum = documentfonn.sellmage.selectedlndex + 1; 

fhame = eval("image" + imageNum + ft .src lf ); 

documentimg.src = fiiame; 

> ' .'■ ' . : ' • " •• • 

function go0{ 

location = links[documentfonn.sdfo 

function showlink(){ 

window-status = Unkstdocument.fonn.seUmage.selectedlndex]; 

}-. ' '' • 
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<JSCBSPT>i ': 

II form tag filled intopODY : . 
. <FORM name= <, f6rm">. •;, 

<SEI^CT NAME=; M selInia^e" size=l 6nCftange="mgchange0f > 
<OPTI(^>lhdex 1 V 

<0^Ol^iadex 2 

<OmON>itadex 3- 

<OPTi0N>lndex 4 ; 

«^SELECT> "'" 

^QRM> 

'-<i. href=f'» baClici^"g60yreton false;" o^6t^eQVe^ M shbwlifikQ; retum lrue;" 
ohMouseGtat=?wmd 
• <IMG SRC=="imagel .gif ' NAME="img M bordeH><^a> 

Herein; the type of the cpinppnent block is the text index (lNDEX_T)(Type = 
INDEXJT), the index information is expressed as the text and is re-expressed using me 
<select> tag as shown in the following ''Table 6 through the text index generating step 414, 
The image index generating step (413) and the text index, generating step (414). are 
performed in the index generator 207 module of FIG. 2, and the index information can be 
extracted in a general manner. 

Table 6 

//javascript filled into HEAD 
<script language^" javaScript"> 
<!- ' 

function change(form){ 

var Ust==form;selectedlhdex; 

location type= i =form.opti(ons[list] .value; 

// location type is selected among me fofiowings 
// - self.location.href : linking to frame belonging to oneself 
// - top.locati6n.href : all screen is changed irrespective of frame 
// - parehtlocatipn.href : parent frame including oneself is changed 
// - parent, framename.locatiori.href : linking to child frame having selected name 
among parent frames 
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form.sel©sfedlhdex = 0; 

•; • - • ". . >. 

• //->•.. 

</scripP» 

// form tag filled into BODY. 
<fonn ^^e^ h iFo^nnaime ,, meth6d= , 'get"> : 
. ; . . <seleqt niMe^"form* ? 6n#^ 

<^ti6h selected>index ^s^op^oh> - / 
10 <bptio.n yaluep"liii #l">md«i: i</6ption> ■* 

<optibh ^valueH"iiinic ^ h >index 2<optibn> 
<optiori value=r"link #3 ">index 3</option>; 
, <^seiect> ; 
<yfonn> 

15 . 

. After each component block is expressed in an appropriate method according to 
the content characteristic as described above, the content object is arranged and generated 
through the new HTML constructing and generating step 416 performed m/me. HTML 
; generator 209 of FIG. 2. The sample code of the following Table 7 provides a tag 

2 0 construction of a total HTML and a simple arranging method of each content object 

• ' ; " Table 7 "', 
'., <HTML> 
<HEAD> 

<TriLE></TITLE>, 

25 <SCRIPT> ~> enclosing script file automatically generated by Java Script 

Generator module. 

This is added in case Image Index is generated 

• <^SCRIPT> 
</HEAD> .'" 

3 0 <BODY> --> Attaching Component Block categorized into INDEX_T or 

BOD Y1G into BODY tag. 
<SELECT> 

<OPTION> ~> generating select Ust form as many as Text Index and arranging 
respective values with. Option tag. . 

35- </sELEct> ';■ 

<TABLE> 



WO 2004/040467 



PCT/KR2003/002322 



<TR> 

<TD> OTan categorized into 

BODYJ3 as value of TABLE tD. At this timb, width of total table/newly generated is 
■ determined according to display p^bimm^ in client. profile. 

5 <IMG : src=^ href : = , ;^*.ianl ,, > listening to content (Title) \ 

<yA> -> connect BOD Y^V block iipnvibrtiBd ^to VoiceXML 
, '<^>\ \'/[ '. 

• • ; ■ y<^pi>i-'r- : v 

V. </TABLE> 
10 </BODY> 
</HTMp> 

The inventive content conv^i^g systeni as described above ; can be put on thrfee 
layers of a web server, a client, aind a proxy/ and respectively has merits and dements 
15 depending on its environment Further, the extraction algdrithiia .'of ibe component and the 
component block can be embodied, in various, methods, and ftprther, an index generating 
and voice? document generating method is exemplified as one of several einbodying; 
methods. : - \ , ' '/ 

FIGs. 9 A and 9B are exemplary view illustrating a converting result of the web 
2 0 content according to a preferred embodiment of the present invention. 

FIG. 9A illustrates a resultant page of the web document Converted t^^ 
, rearrangement of the content unit object and the index extraction, and FIG. 9B illustrates a 
resultant page representing in case the voice supporting markup creating function is added 
to the resultant page of FIG. 9A. 

25 

Industrial Applicability 

As described above, the present invention provides a new technique and system so 
that the web document prepared to be suitable for the display performance of the existing 
general desktop personal computer is converted to be effectively expressed even on the 

30 small display in case the user of the small display device intends to use a web service by 
connecting a wireless internet According to the present invention, the web document is. set 
to the content unit piece by analyzing the structural tag information, and is bundled into a 
similar content unit group and then categorized into the index or body content on basis of 
the content information for rearrangement such that a function of browsing with a 

35 convenient interface without left and right scrolling for a total Web page is provided. 
Further, the extraction and the generation of the. index and the converting of the voice 
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supporting web^document are also, provided, together to provide varibus reconstructions of 
the web document and an expression effect considering the chaiacteristic. of the small 
device: Further; ail effect caii be also ob^ned foir maintaining the content of the origiiial 
document tp the msprimum 

It will be apparent tb those skiUed in the art that various modifications and 
variations can be prepared in tbe present inventioibL Thus, it is intended the present 
invention covers the ^modifications -ajad variations of this invention provided they <come 
within th^ scope of ffiie ^peni^ 



